SEQUENCE LISTING 

(1) GENERAL INFORMATION: 

(i) APPLICANT: Caras, Ingrid W 
(ii) TITLE OF INVENTION: A2-1 Neurotrophic Factor 
(iii) NUMBER OF SEQUENCES: 10 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Genentech, Inc. 

(B) STREET: 1 DNA Way 

(C) CITY: South San Francisco 

(D) STATE: California 

(E) COUNTRY: USA 

(F) ZIP: 94080 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: 3.5 inch, 1.44 Mb floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: WinPatin (Genentech) 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 08/63513 0 

(B) FILING DATE: 19-Mar-1996 

(C) CLASSIFICATION: 

(viii) ATTORNEY /AGENT INFORMATION: 

(A) NAME: Torchia, PhD., Timothy E. 

(B) REGISTRATION NUMBER: 3 6,700 

(C) REFERENCE /DOCKET NUMBER: PlOOl 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 650/225-8674 

(B) TELEFAX: 650/952-9881 

(2) INFORMATION FOR SEQ ID N0:1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1877 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ix) FEATURE: 

(A) NAME/KEY: Extra Cellular Domain 

(B) LOCATION: 244-899 

(C) IDENTIFICATION METHOD: 

(D) OTHER INFORMATION: 



(ix) FEATURE: 

(A) NAME/KEY: Transmembrane Domain 

(B) LOCATION: 901-978 

(C) IDENTIFICATION METHOD: 

(D) OTHER INFORMATION: 

(ix) FEATURE: 

(A) NAME/KEY: signal peptide 

(B) LOCATION: 244-321 

(C) IDENTIFICATION METHOD: 

(D) OTHER INFORMATION: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 



GNTCTAGAAN TAGTGGATCC CCCCGGGCTG CAGGAATTCC GACGGCCCCT 50 

GGAAGGGCTC TGGTGGGGCT GAGCGCTCTG CCGCGGGGGC GCGGGCACAG 100 

CAGGAAGCAG GTCCGCGTGG GCGCTGGGGG CATCAGCTAC CGGGGTGGTC 150 

CGGGCTGAAG AGCCAGGCAG CCAAGGCAGC CACCCCGGGG GGTGGGCGAC 200 

TTTGGGGGAG TTGGTGCCCC GCCCCCCAGG CCTTGGCGGG GTC ATG 246 

Met 
1 

GGG CCC CCC CAT TCT GGG CCG GGG GGC GTG CGA GTC GGG 2 85 
Gly Pro Pro His Ser Gly Pro Gly Gly Val Arg Val Gly 
5 10 

GCC CTG CTG CTG CTG GGG GTT TTG GGG CTG GTG TCT GGG 324 
Ala Leu Leu Leu Leu Gly Val Leu Gly Leu Val Ser Gly 
15 20 25 

CTC AGC CTG GAG CCT GTC TAC TGG AAC TCG GCG AAT AAG 3 63 
Leu Ser Leu Glu Pro Val Tyr Trp Asn Ser Ala Asn Lys 
30 35 40 

AGG TTC CAG GCA GAG GGT GGT TAT GTG CTG TAC CCT CAG 402 
Arg Phe Gin Ala Glu Gly Gly Tyr Val Leu Tyr Pro Gin 

45 50 

ATC GGG GAC CGG CTA GAC CTG CTC TGC CCC CGG GCC CGG 441 
lie Gly Asp Arg Leu Asp Leu Leu Cys Pro Arg Ala Arg 
55 60 65 

CCT CCT GGC CCT CAC TCC TCT CCT AAT TAT GAG TTC TAC 480 
Pro Pro Gly Pro His Ser Ser Pro Asn Tyr Glu Phe Tyr 
70 75 



AAG CTG TAG CTG GTA GGG GGT GCT GAG GGC GGG GGG TGT 519 
Lys Leu Tyr Leu Val Gly Gly Ala Gin Gly Arg Arg Cys 
80 85 90 



GAG GCA CCC CCT GCC CCA AAC CTC CTT CTC ACT TGT GAT 558 
Glu Ala Pro Pro Ala Pro Asn Leu Leu Leu Thr Cys Asp 
95 100 105 

CGC CCA GAC CTG GAT CTC CGC TTC ACC ATC AAG TTC CAG 597 
Arg Pro Asp Leu Asp Leu Arg Phe Thr lie Lys Phe Gin 

110 115 

GAG TAT AGC CCT AAT CTC TGG GGC CAC GAG TTC CGC TCG 63 6 
Glu Tyr Ser Pro Asn Leu Trp Gly His Glu Phe Arg Ser 
120 125 130 

CAC CAC GAT TAG TAG ATC ATT GCC ACA TCG GAT GGG ACC 675 
His His Asp Tyr Tyr lie lie Ala Thr Ser Asp Gly Thr 
135 140 

CGG GAG GGC CTG GAG AGC CTG CAG GGA GGT GTG TGC CTA 714 
Arg Glu Gly Leu Glu Ser Leu Gin Gly Gly Val Cys Leu 
145 150 155 

ACC AGA GGC ATG AAG GTG CTT CTC CGA GTG GGA CAA AGT 753 
Thr Arg Gly Met Lys Val Leu Leu Arg Val Gly Gin Ser 
160 165 170 

CCC CGA GGA GGG GCT GTC CCC CGA AAA CCT GTG TCT GAA 792 
Pro Arg Gly Gly Ala Val Pro Arg Lys Pro Val Ser Glu 

175 180 

ATG CCC ATG GAA AGA GAC CGA GGG GCA GCC CAC AGC CTG 831 
Met Pro Met Glu Arg Asp Arg Gly Ala Ala His Ser Leu 
185 190 195 

GAG CCT GGG AAG GAG AAC CTG CCA GGT GAC CCC ACC AGC 870 
Glu Pro Gly Lys Glu Asn Leu Pro Gly Asp Pro Thr Ser 
200 205 

AAT GCA ACC TCG CGG GGT GCT GAA GGC CCC CTG CCC CCT 909 
Asn Ala Thr Ser Arg Gly Ala Glu Gly Pro Leu Pro Pro 
210 215 220 

CCC AGC ATG CCT GCA GTG GCT GGG GCA GCA GGG GGG CTG 948 
Pro Ser Met Pro Ala Val Ala Gly Ala Ala Gly Gly Leu 
225 230 235 



GCG CTG CTC TTG CTG GGC GTG GCA GGG GCT GGG GGT GCC 987 
Ala Leu Leu Leu Leu Gly Val Ala Gly Ala Gly Gly Ala 



240 



245 



ATG TGT 
Met Cys 
250 

CGC CAC 
Arg His 



CTG GGC 
Leu Gly 
275 

GOT GAG 
Ala Glu 



GGG GOT 
Gly Ala 



GTG AGT 
Val Ser 
315 

GAT GGG 
Asp Gly 



TCG ATT 
Ser lie 
340 

CAA CTG 
Gin Leu 



ACA TTC 
Thr Phe 



TGC CGG 
Cys Arg 
380 



TGG CGG 
Trp Arg 



CCT GGT 
Pro Gly 
265 

CTG GGG 
Leu Gly 



CCT GGG 
Pro Gly 
290 

GCA GAT 
Ala Asp 



GGT GAC 
Gly Asp 



CCC CCC 
Pro Pro 
330 

TCT GTG 
Ser Val 



TTT TTC 
Phe Phe 
355 

TTA TTT 
Leu Phe 



ATG ACT 
Met Thr 



AGA CGG 
Arg Arg 



CCT GGC 
Pro Gly 



GGT GGA 
Gly Gly 
280 

GAG CTA 
Glu Leu 



CCC CCC 
Pro Pro 
305 

TAT GGG 
Tyr Gly 



CAG AGC 
Gin Ser 



TTG GAG 
Leu Glu 
345 

ATG CGA 
Met Arg 



CCT GTG 
Pro Val 
370 

TCA TTT 
Ser Phe 



CGG GCC 
Arg Ala 
255 

TCC TTC 
Ser Phe 



GGT GGG 
Gly Gly 



GGG ATA 
Gly lie 
295 

TTC TGC 
Phe Cys 



CAT CCT 
His Pro 
320 

CCT CCA 
Pro Pro 



TGG CCC 
Trp Pro 



TCC AAG 
Ser Lys 
360 

CAA GTT 
Gin Val 



AGC TTT 
Ser Phe 
385 



AAG CCT 
Lys Pro 



GGG AGG 
Gly Arg 
270 

ATG GGA 
Met Gly 



GCT CTG 
Ala Leu 



CCC CAC 
Pro His 
310 

GTG TAT 
Val Tyr 



AAC ATC 
Asn lie 
335 

ATA TTG 
lie Leu 



TGC TCC 
Cys Ser 



ATT ACG 
lie Thr 
375 

ACC ACC 
Thr Thr 



TCG GAG 
Ser Glu 
260 

GGA GGG 
Gly Gly 



CCT CGG 
Pro Arg 
285 

CGG GGT 
Arg Gly 



TAT GAG 
Tyr Glu 



ATC GTG 
lie Val 
325 

TAC TAC 
Tyr Tyr 



CAT ACG 
His Thr 
350 

CGT GTC 
Arg Val 



ACA TCG 
Thr Ser 



CTG AAC 
Leu Asn 
390 



AGT 102'6 
Ser 



TCT 1065 
Ser 



GAG 1104 
Glu 



GGC 1143 

Gly 

300 

AAG 1182 
Lys 



CAG 1221 
Gin 



ACA 1260 
Thr 



ATA 1299 

He 



ACT 133 8 

Thr 

365 

ACT 1377 
Thr 



CCA 1416 
Pro 



TCC ATG CAG GCC TGC AGA GCA CAG ATG GGG GAA TTC CGA 1455 
Ser Met Gin Ala Cys Arg Ala Gin Met Gly Glu Phe Arg 
395 400 



ATC AGA TGG TGT TTC TGG GGG GAC AGG ATC CTG GGT ACG 1494 
lie Arg Trp Cys Phe Trp Gly Asp Arg lie Leu Gly Thr 
405 410 415 

GCT CTG TTT GTG CTT GTG CTT ATT CTT CTT CTT GGG AGG 1533 
Ala Leu Phe Val Leu Val Leu lie Leu Leu Leu Gly Arg 
420 425 430 

CTG AAT ATG CAT CAG ACG ACA CTG CTC CGG CAA CGG GCC 1572 
Leu Asn Met His Gin Thr Thr Leu Leu Arg Gin Arg Ala 

435 440 

AGT GTG GAG GCG GAA GCC GGC CAG CAT GGT CCC CTG TG 1610 
Ser Val Glu Ala Glu Ala Gly Gin His Gly Pro Leu 
445 450 455 

ATAGGATTGA AAGAGCTACT GAGAATAGGG GGCTTCTCAA TGAGAGAGCG 1660 

GAGGCTGCTG TTATCATGGG AACCAGGCAG ATCAATCATC CCTGGCAGGT 1710 

CAGGCAGGAA GTTACTTAGC TTCTCCTTCA CCTTCTTCCC ACAGAATTTA 1760 

TTATAGGCTT GTTCCAAGTT GTAGTGTGTG ATCAGATTCG TGCTGCCTGT 1810 

CAGCTCTGTG CTACCTGGCA GTTCCCCTCA TGGAATTCGA TATCAAGCTT 1860 

ATCGATACCG TCGACCT 1877 

(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 455 amino acids 

(B) TYPE: Amino Acid 
(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 

Met Gly Pro Pro His Ser Gly Pro Gly Gly Val Arg Val Gly Ala 
15 10 15 

Leu Leu Leu Leu Gly Val Leu Gly Leu Val Ser Gly Leu Ser Leu 

20 25 30 

Glu Pro Val Tyr Trp Asn Ser Ala Asn Lys Arg Phe Gin Ala Glu 

35 40 45 

Gly Gly Tyr Val Leu Tyr Pro Gin lie Gly Asp Arg Leu Asp Leu 

50 55 60 



Leu Cys Pro Arg Ala Arg Pro Pro Gly Pro His Ser Ser Pro Asn 



65 70 75 

Tyr Glu Phe Tyr Lys Leu Tyr Leu Val Gly Gly Ala Gin Gly Arg 

80 85 90 

Arg Cys Glu Ala Pro Pro Ala Pro Asn Leu Leu Leu Thr Cys Asp 

95 100 105 

Arg Pro Asp Leu Asp Leu Arg Phe Thr lie Lys Phe Gin Glu Tyr 

110 115 120 

Ser Pro Asn Leu Trp Gly His Glu Phe Arg Ser His His Asp Tyr 

125 130 135 

Tyr lie lie Ala Thr Ser Asp Gly Thr Arg Glu Gly Leu Glu Ser 

140 145 150 

Leu Gin Gly Gly Val Cys Leu Thr Arg Gly Met Lys Val Leu Leu 

155 160 165 

Arg Val Gly Gin Ser Pro Arg Gly Gly Ala Val Pro Arg Lys Pro 

170 175 180 

Val Ser Glu Met Pro Met Glu Arg Asp Arg Gly Ala Ala His Ser 

185 190 195 

Leu Glu Pro Gly Lys Glu Asn Leu Pro Gly Asp Pro Thr Ser Asn 

200 205 210 

Ala Thr Ser Arg Gly Ala Glu Gly Pro Leu Pro Pro Pro Ser Met 

215 220 225 

Pro Ala Val Ala Gly Ala Ala Gly Gly Leu Ala Leu Leu Leu Leu 

230 235 240 

Gly Val Ala Gly Ala Gly Gly Ala Met Cys Trp Arg Arg Arg Arg 

245 250 255 

Ala Lys Pro Ser Glu Ser Arg His Pro Gly Pro Gly Ser Phe Gly 

260 265 270 

Arg Gly Gly Ser Leu Gly Leu Gly Gly Gly Gly Gly Met Gly Pro 

275 280 285 

Arg Glu Ala Glu Pro Gly Glu Leu Gly lie Ala Leu Arg Gly Gly 

290 295 300 



Gly Ala Ala Asp Pro Pro Phe Cys Pro His Tyr Glu Lys Val Ser 

305 310 315 



Gly Asp Tyr Gly His Pro Val Tyr lie Val Gin Asp Gly Pro Pro 

320 325 330 



Gin Ser Pro Pro Asn lie Tyr Tyr Thr Ser lie Ser Val Leu Glu 

335 340 345 

Trp Pro lie Leu His Thr lie Gin Leu Phe Phe Met Arg Ser Lys 

350 355 360 

Cys Ser Arg Val Thr Thr Phe Leu Phe Pro Val Gin Val lie Thr 

365 370 375 

Thr Ser Thr Cys Arg Met Thr Ser Phe Ser Phe Thr Thr Leu Asn 

380 385 390 

Pro Ser Met Gin Ala Cys Arg Ala Gin Met Gly Glu Phe Arg lie 

395 400 405 

Arg Trp Cys Phe Trp Gly Asp Arg lie Leu Gly Thr Ala Leu Phe 

410 415 420 

Val Leu Val Leu lie Leu Leu Leu Gly Arg Leu Asn Met His Gin 

425 430 435 

Thr Thr Leu Leu Arg Gin Arg Ala Ser Val Glu Ala Glu Ala Gly 

440 445 450 

Gin His Gly Pro Leu 

455 

(2) INFORMATION FOR SEQ ID NO : 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2380 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 



GNTCTAGAAN TAGTGGATCC CCCCGGGCTG CAGGAATTCC GACGGCCCCT 50 
GGAAGGGCTC TGGTGGGGCT GAGCGCTCTG CCGCGGGGGC GCGGGCACAG 100 
CAGGAAGCAG GTCCGCGTGG GCGCTGGGGG CATCAGCTAC CGGGGTGGTC 150 
CGGGCTGAAG AGCCAGGCAG CCAAGGCAGC CACCCCGGGG GGTGGGCGAC 200 



TTTGGGGGAG TTGGTGCCCC GCCCCCCAGG CCTTGGCGGG GTCATGGGGC 250 



CCCCCCATTC TGGGCCGGGG GGCGTGCGAG TCGGGGCCCT GCTGCTGCTG 300 
GGGGTTTTGG GGCTGGTGTC TGGGCTCAGC CTGGAGCCTG TCTACTGGAA 350 
CTCGGCGAAT AAGAGGTTCC AGGCAGAGGG TGGTTATGTG CTGTACCCTC 400 
AGATCGGGGA CCGGCTAGAC CTGCTCTGCC CCCGGGCCCG GCCTCCTGGC 450 
CCTCACTCCT CTCCTAATTA TGAGTTCTAC AAGCTGTACC TGGTAGGGGG 500 
TGCTCAGGGC CGGCGCTGTG AGGCACCCCC TGCCCCAAAC CTCCTTCTCA 550 
CTTGTGATCG CCCAGACCTG GATCTCCGCT TCACCATCAA GTTCCAGGAG 600 
TATAGCCCTA ATCTCTGGGG CCACGAGTTC CGCTCGCACC ACGATTACTA 650 
CATCATTGCC ACATCGGATG GGACCCGGGA GGGCCTGGAG AGCCTGCAGG 700 
GAGGTGTGTG CCTAACCAGA GGCATGAAGG TGCTTCTCCG AGTGGGACAA 750 
AGTCCCCGAG GAGGGGCTGT CCCCCGAAAA CCTGTGTCTG AAATGCCCAT 800 
GGAAAGAGAC CGAGGGGCAG CCCACAGCCT GGAGCCTGGG AAGGAGAACC 850 
TGCCAGGTGA CCCCACCAGC AATGCAACCT CCCGGGGTGC TGAAGGCCCC 900 
CTGCCCCCTC CCAGCATGCC TGCAGTGGCT GGGGCAGCAG GGGGGCTGGC 950 
GCTGCTCTTG CTGGGCGTGG CAGGGGCTGG GGGTGCCATG TGTTGGCGGA 1000 
GACGGCGGGC CAAGCCTTCG GAGAGTCGCC ACCCTGGTCC TGGCTCCTTC 1050 
GGGAGGGGAG GGTCTCTGGG CCTGGGGGGT GGAGGTGGGA TGGGACCTCG 1100 
GGAGGCTGAG CCTGGGGAGC TAGGGATAGC TCTGCGGGGT GGCGGGGCTG 1150 
CAGATCCCCC CTTCTGCCCC CACTATGAGA AGGTGAGTGG TGACTATGGG 12 00 
CATCCTGTGT ATATCGTGCA GGATGGGCCC CCCCAGAGCC CTCCAAACAT 1250 
CTACTACAAG GTATGAGGGC TCCTCTCACG TGGCTATCCT GAATCCAGCC 13 00 
CTTCTTGGGG TGCTCCTCCA GTTTAATTCC TGGTTTGAGG GACACCTCTA 13 50 
ACATCTCGGC CCCCTGTGCC CCCCCAGCCC CTTCACTCCT CCCGGCTGCT 1400 
GTCCTCGTCT CCACTTTTAG GATTCCTTAG GATTCCCACT GCCCCACTTC 1450 
CTGCCCTCCC GTTTGGCCAT GGGTGCCCCC CTCTGTCTCA GTGTCCCTGG 1500 



ATCCTTTTTC CTTGGGGAGG GGCACAGGCT CAGCCTCCTC TCTGACCATG 1550 
ACCCAGGCAT CCTTGTCCCC CTCACCCACC CAGAGCTAGG GGCGGGAACA 1600 
GCCCACCTTT TGGTTGGCAC CGCCTTCTTT CTGCCTCTCA CTGGTTTTCT 1650 
CTTCTCTATC TCTTATTCTT TCCCTCTCTT CCGTCTCTAG GTCTGTTCTT 1700 
CTTCCCTAGC ATCCTCCTCC CCACATCTCC TTTCACCCTC TTGGCTTCTT 1750 
ATCCTGTGCC TCTCCCATCT CCTGGGTGGG GGCATCAAAG CATTTCTCCC 1800 
CTTAGCTTTC AGCCCCCCTT CTGACCTCTC ATACCAACCA CTCCCCTCAG 1850 
TCTGCCAAAA ATGGGGGCCT TATGGGGAAG GCTCTGACAC TCCACCCCAG 1900 
CTCAGGCCAT GGGCAGCAGG GCTCCATTCT CTGGCCTGGC CCAGGCCTCT 1950 
ACATACTTAC TCCAGCCATT TGGGGTGGTT GGGTCATGAC AGCTACCATG 2000 
AGAAGAAGTG TCCCGTTTTG TCCAGTGGCC AATAGCAAGA TATGAACCGG 2050 
TCGGGACATG TATGGACTTG GTCTGATGCT GAATGGGCCA CTTGGGACCG 2100 
GAAGTGACTT GCTCCAGACA AGAGGTGACC AGGCCCGGAC AGAAATGGCC 2150 
TGGGAAGTAG CAGAAGCAGT GCAGCAGGAA CTGGAAGTGC CTTCATCCAG 2200 
GACAGGAAGT AGCACTTCTG AAACAGGAAG TGGTCTGGCT GGAACTCCAA 2250 
GTGGCTTAGT CTGGGGGATC AGGAGGTGGG AGGTGGATGG TTCTTATTCT 2300 
GTGGAGAAGA AGGGCGGGAA GAACTTCCTT TCAGGAGGAA GCTGGAACTT 2350 
ACTGACTGTA AGAGGTTAGA GGTGGACCGA 23 80 
(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 340 amino acids 

(B) TYPE: Amino Acid 
(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

Met Gly Pro Pro His Ser Gly Pro Gly Gly Val Arg Val Gly Ala 
15 10 15 

Leu Leu Leu Leu Gly Val Leu Gly Leu Val Ser Gly Leu Ser Leu 

20 25 30 




Glu Pro Val Tyr Trp Asn Ser Ala Asn Lys Arg Phe Gin Ala Glu 

35 40 45 

Gly Gly Tyr Val Leu Tyr Pro Gin lie Gly Asp Arg Leu Asp Leu 



Leu Cys Pro Arg Ala Arg Pro Pro Gly Pro His Ser Ser Pro Asn 

65 70 75 

Tyr Glu Phe Tyr Lys Leu Tyr Leu Val Gly Gly Ala Gin Gly Arg 

80 85 90 

Arg Cys Glu Ala Pro Pro Ala Pro Asn Leu Leu Leu Thr Cys Asp 

95 100 105 

Arg Pro Asp Leu Asp Leu Arg Phe Thr lie Lys Phe Gin Glu Tyr 

110 115 120 

Ser Pro Asn Leu Trp Gly His Glu Phe Arg Ser His His Asp Tyr 

125 130 135 

Tyr lie lie Ala Thr Ser Asp Gly Thr Arg Glu Gly Leu Glu Ser 

140 145 150 

Leu Gin Gly Gly Val Cys Leu Thr Arg Gly Met Lys Val Leu Leu 

155 160 165 

Arg Val Gly Gin Ser Pro Arg Gly Gly Ala Val Pro Arg Lys Pro 

170 175 180 

Val Ser Glu Met Pro Met Glu Arg Asp Arg Gly Ala Ala His Ser 

185 190 195 

Leu Glu Pro Gly Lys Glu Asn Leu Pro Gly Asp Pro Thr Ser Asn 

200 205 210 

Ala Thr Ser Arg Gly Ala Glu Gly Pro Leu Pro Pro Pro Ser Met 

215 220 225 

Pro Ala Val Ala Gly Ala Ala Gly Gly Leu Ala Leu Leu Leu Leu 

230 235 240 

Gly Val Ala Gly Ala Gly Gly Ala Met Cys Trp Arg Arg Arg Arg 

245 250 255 

Ala Lys Pro Ser Glu Ser Arg His Pro Gly Pro Gly Ser Phe Gly 



260 



265 



270 



Arg Gly Gly Ser Leu Gly Leu Gly Gly Gly Gly Gly Met Gly Pro 

275 280 285 



Arg Glu Ala Glu Pro Gly Glu Leu Gly lie Ala Leu Arg Gly.Gly 

290 295 300 



Gly Ala Ala Asp Pro Pro Phe Cys Pro His Tyr Glu Lys Val Ser 

305 310 315 

Gly Asp Tyr Gly His Pro Val Tyr lie Val Gin Asp Gly Pro Pro 

320 325 330 

Gin Ser Pro Pro Asn lie Tyr Tyr Lys Val 

335 340 

(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 454 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 



GCCGACGCTG TGAGGCACCC CCTGCCCCAA ACCTCCTTCT CACTTGTGAT 50 
CGCCCAGACC TGGATCTCCG CTTCACCATC AAGTTCCAGG AGTATAGCCC 100 
TAATCTCTGG GGCCACGAGT TCCGCTCGCA CCACGATTAC TACATCATTG 150 
CCACATCGGA TGGGACCCGG GAGGCCTGGG AGAGCCTGCA GGGAAGTGTG 200 
TGCCTAACCA GAGGCATGAA GGTGCTTCTC CGAGTNGGAC AAAGTCCCGA 250 
GGAGGGGCTG TCCCCCGAAA ACCTGTGTCT GAAATGCCCA TGGAAAGAGA 300 
CCGAGGGGCA GCCCACAGCC TGGGAGCCTG GGGAAGGAGA ACCTGCCAGG 350 
TGACCCCACC AGCAATNCAA CCTTCCGGGG TTGCTTGAAG GGCCCCTTGA 400 
CCCTTTCCCA GCATTGCNTG CANTTGGTTN GGGGCAGCAN GGGGGNGTTT 450 
TGGC 454 

(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acids 

(B) TYPE: Amino Acid 
(D) TOPOLOGY: Linear 




(xi) SEQUENCE DESCRIPTION: SEQ 



ID NO: 6: 



Asp Lys Thr His Thr Cys Pro Pro 
1 5 



Cys Pro 
10 



(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 60 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 



GGACAAAGTC CCGAGGAGGG GCTGTCCCCC GAAAACCTGT GTCTGAAATG 50 
CCCATGGAAA 60 

(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 60 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 



CAGGTTCTCC TTCCCCAGGC TCCCAGGCTG TGGGCTGCCC CTCGGTCTCT 50 
TTCCATGGGC 60 

(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 346 amino acids 

(B) TYPE: Amino Acid 
(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 

Met Ala Arg Pro Gly Gin Arg Trp Leu Gly Lys Trp Leu Val Ala 
15 10 15 



Met Val Val Trp Ala Leu Cys Arg Leu Ala Thr Pro Leu Ala Lys 

20 25 30 



• 



Asn Leu Glu Pro Val Ser Trp Ser Ser Leu Asn Pro Lys Phe Leu 

35 40 45 

Ser Gly Lys Gly Leu Val lie Tyr Pro Lys lie Gly Asp Lys Leu 

50 55 60 

Asp lie lie Cys Pro Arg Ala Glu Ala Gly Arg Pro Tyr Glu Tyr 

65 70 75 

Tyr Lys Leu Tyr Leu Val Arg Pro Glu Gin Ala Ala Ala Cys Ser 

80 85 90 

Thr Val Leu Asp Pro Asn Val Leu Val Thr Cys Asn Arg Pro Glu 

95 100 105 

Gin Glu lie Arg Phe Thr lie Lys Phe Gin Glu Phe Ser Pro Asn 

110 115 120 

Tyr Met Gly Leu Glu Phe Lys Lys His His Asp Tyr Tyr lie Thr 

125 130 135 

Ser Thr Ser Asn Gly Ser Leu Glu Gly Leu Glu Asn Arg Glu Gly 

140 145 150 

Gly Val Cys Arg Thr Arg Thr Met Lys lie lie Met Lys Val Gly 

155 160 165 

Gin Asp Pro Asn Ala Val Thr Pro Glu Gin Leu Thr Thr Ser Arg 

170 175 180 

Pro Ser Lys Glu Ala Asp Asn Thr Val Lys Met Ala Thr Gin Ala 

185 190 195 

Pro Gly Ser Arg Gly Ser Leu Gly Asp Ser Asp Gly Lys His Glu 

200 205 210 

Thr Val Asn Gin Glu Glu Lys Ser Gly Pro Gly Ala Ser Gly Gly 

215 220 225 

Ser Ser Gly Asp Pro Asp Gly Phe Phe Asn Ser Lys Val Ala Leu 

230 235 240 

Phe Ala Ala Val Gly Ala Gly Cys Val lie Phe Leu Leu lie lie 

245 250 255 

lie Phe Leu Thr Val Leu Leu Leu Lys Leu Arg Lys Arg His Arg 



260 



265 



270 



Lys His Thr Gin Gin Arg Ala Ala Ala Leu Ser Leu Ser Thr Leu 

275 280 285 



Ala Ser Pro Lys Gly Gly Ser Gly Thr Ala Gly Thr Glu Pro Ser 

290 295 300 

Asp lie lie lie Pro Leu Arg Thr Thr Glu Asn Asn Tyr Cys Pro 

305 310 315 

His Tyr Glu Lys Val Ser Gly Asp Tyr Gly His Pro Val Tyr lie 

320 325 330 

Val Gin Glu Met Pro Pro Gin Ser Pro Ala Asn lie Tyr Tyr Lys 

335 340 345 

Val 
346 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 333 amino acids 

(B) TYPE: Amino Acid 
(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Met Ala Val Arg Arg Asp Ser Val Trp Lys Tyr Cys Trp Gly Val 
15 10 15 

Leu Met Val Leu Cys Arg Thr Ala lie Ser Lys Ser lie Val Leu 

20 25 30 

Glu Pro lie Tyr Trp Asn Ser Ser Asn Ser Lys Phe Leu Pro Gly 

35 40 45 

Gin Gly Leu Val Leu Tyr Pro Gin lie Gly Asp Lys Leu Asp lie 

50 55 60 

lie Cys Pro Lys Val Asp Ser Lys Thr Val Gly Gin Tyr Glu Tyr 

65 70 75 

Tyr Lys Val Tyr Met Val Asp Lys Asp Gin Ala Asp Arg Cys Thr 

80 85 90 

lie Lys Lys Glu Asn Thr Pro Leu Leu Asn Cys Ala Lys Pro Asp 

95 100 105 

Gin Asp lie Lys Phe Thr lie Lys Phe Gin Glu Phe Ser Pro Asn 

110 115 120 



Leu Trp Gly Leu Glu Phe Gin Lys Asn Lys Asp Tyr Tyr lie lie 

125 130 135 



Ser Thr Ser Asn Gly Ser Leu Glu Gly Leu Asp Asn Gin Glu Gly 

140 145 150 



Gly Val Cys Gin Thr Arg Ala Met Lys lie Leu Met Lys Val Gly 

155 160 165 

Gin Asp Ala Ser Ser Ala Gly Ser Thr Arg Asn Lys Asp Pro Thr 

170 175 180 

Arg Arg Pro Glu Leu Glu Ala Gly Thr Asn Gly Arg Ser Ser Thr 

185 190 195 

Thr Ser Pro Phe Val Lys Pro Asn Pro Gly Ser Ser Thr Asp Gly 

200 205 210 

Asn Ser Ala Gly His Ser Gly Asn Asn lie Leu Gly Ser Glu Val 

215 220 225 

Ala Leu Phe Ala Gly lie Ala Ser Gly Cys lie lie Phe lie Val 

230 235 240 

lie lie lie Thr Leu Val Val Leu Leu Leu Lys Tyr Arg Arg Arg 

245 250 255 

His Arg Lys His Ser Pro Gin His Thr Thr Thr Leu Ser Leu Ser 

260 265 270 

Thr Leu Ala Thr Pro Lys Arg Ser Gly Asn Asn Asn Gly Ser Glu 

275 280 285 

Pro Ser Asp lie lie lie Pro Leu Arg Thr Ala Asp Ser Val Phe 

290 295 300 

Cys Pro His Tyr Glu Lys Val Ser Gly Asp Tyr Gly His Pro Val 

305 310 315 

Tyr lie Val Gin Glu Met Pro Pro Gin Ser Pro Ala Asn lie Tyr 

320 325 330 



Tyr Lys Val 
333 



